Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 3002 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 400.7 KiB |
| Average record size in memory | 136.7 B |
Variable types
| NUM | 8 |
|---|---|
| CAT | 1 |
product_category_name_english has a high cardinality: 71 distinct values | High cardinality |
encodedCategory has 49 (1.6%) zeros | Zeros |
DiscountPercent has 1326 (44.2%) zeros | Zeros |
SaleRecord_last6week has 88 (2.9%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-27 18:50:20.835932 |
|---|---|
| Analysis finished | 2020-09-27 18:51:09.866087 |
| Duration | 49.03 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
Week
Real number (ℝ≥0)
| Distinct | 52 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.77148568 |
|---|---|
| Minimum | 1 |
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 23.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 13 |
| median | 25 |
| Q3 | 38 |
| 95-th percentile | 49 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.71682319 |
|---|---|
| Coefficient of variation (CV) | 0.5710506322 |
| Kurtosis | -1.145359379 |
| Mean | 25.77148568 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.0682075063 |
| Sum | 77366 |
| Variance | 216.5848848 |
| Monotocity | Increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 31 | 64 | 2.1% | |
| 19 | 64 | 2.1% | |
| 32 | 64 | 2.1% | |
| 33 | 64 | 2.1% | |
| 16 | 63 | 2.1% | |
| 14 | 63 | 2.1% | |
| 34 | 63 | 2.1% | |
| 13 | 63 | 2.1% | |
| 23 | 63 | 2.1% | |
| 29 | 63 | 2.1% | |
| 12 | 63 | 2.1% | |
| 28 | 62 | 2.1% | |
| 26 | 62 | 2.1% | |
| 8 | 62 | 2.1% | |
| 15 | 62 | 2.1% | |
| 21 | 61 | 2.0% | |
| 24 | 61 | 2.0% | |
| 17 | 61 | 2.0% | |
| 11 | 61 | 2.0% | |
| 30 | 61 | 2.0% | |
| 3 | 60 | 2.0% | |
| 22 | 60 | 2.0% | |
| 20 | 60 | 2.0% | |
| 18 | 60 | 2.0% | |
| 2 | 60 | 2.0% | |
| Other values (27) | 1452 | 48.4% |
| Value | Count | Frequency (%) | |
| 1 | 56 | 1.9% | |
| 2 | 60 | 2.0% | |
| 3 | 60 | 2.0% | |
| 4 | 57 | 1.9% | |
| 5 | 56 | 1.9% | |
| 6 | 57 | 1.9% | |
| 7 | 59 | 2.0% | |
| 8 | 62 | 2.1% | |
| 9 | 56 | 1.9% | |
| 10 | 58 | 1.9% |
| Value | Count | Frequency (%) | |
| 52 | 45 | 1.5% | |
| 51 | 50 | 1.7% | |
| 50 | 53 | 1.8% | |
| 49 | 56 | 1.9% | |
| 48 | 58 | 1.9% | |
| 47 | 54 | 1.8% | |
| 46 | 55 | 1.8% | |
| 45 | 50 | 1.7% | |
| 44 | 52 | 1.7% | |
| 43 | 46 | 1.5% |
| Distinct | 71 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.6 KiB |
| office_furniture | 52 |
|---|---|
| food | 52 |
| furniture_decor | 52 |
| housewares | 52 |
| sports_leisure | 52 |
| Other values (66) |
| Value | Count | Frequency (%) | |
| office_furniture | 52 | 1.7% | |
| food | 52 | 1.7% | |
| furniture_decor | 52 | 1.7% | |
| housewares | 52 | 1.7% | |
| sports_leisure | 52 | 1.7% | |
| cool_stuff | 52 | 1.7% | |
| pet_shop | 52 | 1.7% | |
| musical_instruments | 52 | 1.7% | |
| electronics | 52 | 1.7% | |
| consoles_games | 52 | 1.7% | |
| watches_gifts | 52 | 1.7% | |
| telephony | 52 | 1.7% | |
| computers_accessories | 52 | 1.7% | |
| garden_tools | 52 | 1.7% | |
| baby | 52 | 1.7% | |
| stationery | 52 | 1.7% | |
| perfumery | 52 | 1.7% | |
| market_place | 52 | 1.7% | |
| toys | 52 | 1.7% | |
| audio | 52 | 1.7% | |
| health_beauty | 52 | 1.7% | |
| fashion_bags_accessories | 52 | 1.7% | |
| auto | 52 | 1.7% | |
| bed_bath_table | 52 | 1.7% | |
| small_appliances | 52 | 1.7% | |
| Other values (46) | 1702 | 56.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 39 |
|---|---|
| Median length | 15 |
| Mean length | 15.6538974 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 4195 | 8.9% | |
| e | 4106 | 8.7% | |
| s | 3987 | 8.5% | |
| _ | 3584 | 7.6% | |
| t | 3326 | 7.1% | |
| n | 3193 | 6.8% | |
| i | 3089 | 6.6% | |
| r | 2977 | 6.3% | |
| a | 2870 | 6.1% | |
| c | 2493 | 5.3% | |
| u | 1935 | 4.1% | |
| l | 1863 | 4.0% | |
| h | 1364 | 2.9% | |
| d | 1271 | 2.7% | |
| m | 1230 | 2.6% | |
| f | 1206 | 2.6% | |
| p | 1156 | 2.5% | |
| g | 1013 | 2.2% | |
| y | 705 | 1.5% | |
| b | 697 | 1.5% | |
| k | 330 | 0.7% | |
| w | 168 | 0.4% | |
| v | 123 | 0.3% | |
| 2 | 66 | 0.1% | |
| x | 46 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 43343 | 92.2% | |
| Connector Punctuation | 3584 | 7.6% | |
| Decimal Number | 66 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 4195 | 9.7% | |
| e | 4106 | 9.5% | |
| s | 3987 | 9.2% | |
| t | 3326 | 7.7% | |
| n | 3193 | 7.4% | |
| i | 3089 | 7.1% | |
| r | 2977 | 6.9% | |
| a | 2870 | 6.6% | |
| c | 2493 | 5.8% | |
| u | 1935 | 4.5% | |
| l | 1863 | 4.3% | |
| h | 1364 | 3.1% | |
| d | 1271 | 2.9% | |
| m | 1230 | 2.8% | |
| f | 1206 | 2.8% | |
| p | 1156 | 2.7% | |
| g | 1013 | 2.3% | |
| y | 705 | 1.6% | |
| b | 697 | 1.6% | |
| k | 330 | 0.8% | |
| w | 168 | 0.4% | |
| v | 123 | 0.3% | |
| x | 46 | 0.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 3584 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 66 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 43343 | 92.2% | |
| Common | 3650 | 7.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 4195 | 9.7% | |
| e | 4106 | 9.5% | |
| s | 3987 | 9.2% | |
| t | 3326 | 7.7% | |
| n | 3193 | 7.4% | |
| i | 3089 | 7.1% | |
| r | 2977 | 6.9% | |
| a | 2870 | 6.6% | |
| c | 2493 | 5.8% | |
| u | 1935 | 4.5% | |
| l | 1863 | 4.3% | |
| h | 1364 | 3.1% | |
| d | 1271 | 2.9% | |
| m | 1230 | 2.8% | |
| f | 1206 | 2.8% | |
| p | 1156 | 2.7% | |
| g | 1013 | 2.3% | |
| y | 705 | 1.6% | |
| b | 697 | 1.6% | |
| k | 330 | 0.8% | |
| w | 168 | 0.4% | |
| v | 123 | 0.3% | |
| x | 46 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| _ | 3584 | 98.2% | |
| 2 | 66 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 46993 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 4195 | 8.9% | |
| e | 4106 | 8.7% | |
| s | 3987 | 8.5% | |
| _ | 3584 | 7.6% | |
| t | 3326 | 7.1% | |
| n | 3193 | 6.8% | |
| i | 3089 | 6.6% | |
| r | 2977 | 6.3% | |
| a | 2870 | 6.1% | |
| c | 2493 | 5.3% | |
| u | 1935 | 4.1% | |
| l | 1863 | 4.0% | |
| h | 1364 | 2.9% | |
| d | 1271 | 2.7% | |
| m | 1230 | 2.6% | |
| f | 1206 | 2.6% | |
| p | 1156 | 2.5% | |
| g | 1013 | 2.2% | |
| y | 705 | 1.5% | |
| b | 697 | 1.5% | |
| k | 330 | 0.7% | |
| w | 168 | 0.4% | |
| v | 123 | 0.3% | |
| 2 | 66 | 0.1% | |
| x | 46 | 0.1% |
| Distinct | 71 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.30812791 |
|---|---|
| Minimum | 0 |
| Maximum | 70 |
| Zeros | 49 |
| Zeros (%) | 1.6% |
| Memory size | 23.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 17 |
| median | 36 |
| Q3 | 53 |
| 95-th percentile | 68 |
| Maximum | 70 |
| Range | 70 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 20.72477153 |
|---|---|
| Coefficient of variation (CV) | 0.5869688584 |
| Kurtosis | -1.207570656 |
| Mean | 35.30812791 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | -0.01446589188 |
| Sum | 105995 |
| Variance | 429.5161552 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 69 | 52 | 1.7% | |
| 42 | 52 | 1.7% | |
| 5 | 52 | 1.7% | |
| 68 | 52 | 1.7% | |
| 66 | 52 | 1.7% | |
| 60 | 52 | 1.7% | |
| 56 | 52 | 1.7% | |
| 54 | 52 | 1.7% | |
| 36 | 52 | 1.7% | |
| 15 | 52 | 1.7% | |
| 28 | 52 | 1.7% | |
| 26 | 52 | 1.7% | |
| 20 | 52 | 1.7% | |
| 16 | 52 | 1.7% | |
| 6 | 52 | 1.7% | |
| 4 | 52 | 1.7% | |
| 7 | 52 | 1.7% | |
| 70 | 52 | 1.7% | |
| 39 | 52 | 1.7% | |
| 65 | 52 | 1.7% | |
| 63 | 52 | 1.7% | |
| 49 | 52 | 1.7% | |
| 43 | 52 | 1.7% | |
| 59 | 52 | 1.7% | |
| 53 | 52 | 1.7% | |
| Other values (46) | 1702 | 56.7% |
| Value | Count | Frequency (%) | |
| 0 | 49 | 1.6% | |
| 1 | 47 | 1.6% | |
| 2 | 44 | 1.5% | |
| 3 | 9 | 0.3% | |
| 4 | 52 | 1.7% | |
| 5 | 52 | 1.7% | |
| 6 | 52 | 1.7% | |
| 7 | 52 | 1.7% | |
| 8 | 50 | 1.7% | |
| 9 | 32 | 1.1% |
| Value | Count | Frequency (%) | |
| 70 | 52 | 1.7% | |
| 69 | 52 | 1.7% | |
| 68 | 52 | 1.7% | |
| 67 | 38 | 1.3% | |
| 66 | 52 | 1.7% | |
| 65 | 52 | 1.7% | |
| 64 | 27 | 0.9% | |
| 63 | 52 | 1.7% | |
| 62 | 44 | 1.5% | |
| 61 | 2 | 0.1% |
count_perCategory
Real number (ℝ≥0)
| Distinct | 268 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.8344437 |
|---|---|
| Minimum | 1 |
| Maximum | 457 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 23.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 9 |
| Q3 | 51 |
| 95-th percentile | 186 |
| Maximum | 457 |
| Range | 456 |
| Interquartile range (IQR) | 48 |
Descriptive statistics
| Standard deviation | 61.94179508 |
|---|---|
| Coefficient of variation (CV) | 1.595022078 |
| Kurtosis | 6.270961931 |
| Mean | 38.8344437 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 2.416990393 |
| Sum | 116581 |
| Variance | 3836.785978 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1 | 373 | 12.4% | |
| 2 | 277 | 9.2% | |
| 3 | 195 | 6.5% | |
| 4 | 167 | 5.6% | |
| 5 | 122 | 4.1% | |
| 6 | 118 | 3.9% | |
| 8 | 95 | 3.2% | |
| 7 | 89 | 3.0% | |
| 9 | 71 | 2.4% | |
| 10 | 62 | 2.1% | |
| 11 | 51 | 1.7% | |
| 13 | 49 | 1.6% | |
| 15 | 42 | 1.4% | |
| 18 | 37 | 1.2% | |
| 12 | 34 | 1.1% | |
| 14 | 33 | 1.1% | |
| 16 | 30 | 1.0% | |
| 21 | 28 | 0.9% | |
| 19 | 26 | 0.9% | |
| 20 | 22 | 0.7% | |
| 22 | 20 | 0.7% | |
| 24 | 19 | 0.6% | |
| 32 | 19 | 0.6% | |
| 27 | 19 | 0.6% | |
| 54 | 18 | 0.6% | |
| Other values (243) | 986 | 32.8% |
| Value | Count | Frequency (%) | |
| 1 | 373 | 12.4% | |
| 2 | 277 | 9.2% | |
| 3 | 195 | 6.5% | |
| 4 | 167 | 5.6% | |
| 5 | 122 | 4.1% | |
| 6 | 118 | 3.9% | |
| 7 | 89 | 3.0% | |
| 8 | 95 | 3.2% | |
| 9 | 71 | 2.4% | |
| 10 | 62 | 2.1% |
| Value | Count | Frequency (%) | |
| 457 | 1 | < 0.1% | |
| 389 | 1 | < 0.1% | |
| 378 | 1 | < 0.1% | |
| 377 | 1 | < 0.1% | |
| 367 | 1 | < 0.1% | |
| 360 | 1 | < 0.1% | |
| 351 | 1 | < 0.1% | |
| 344 | 1 | < 0.1% | |
| 334 | 1 | < 0.1% | |
| 330 | 1 | < 0.1% |
paymentValue
Real number (ℝ≥0)
| Distinct | 2953 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 207.0889975 |
|---|---|
| Minimum | 11.93 |
| Maximum | 12164.26556 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 23.6 KiB |
Quantile statistics
| Minimum | 11.93 |
|---|---|
| 5-th percentile | 48.01391667 |
| Q1 | 97.12510442 |
| median | 145.6151679 |
| Q3 | 211.3764286 |
| 95-th percentile | 532.2223333 |
| Maximum | 12164.26556 |
| Range | 12152.33556 |
| Interquartile range (IQR) | 114.2513241 |
Descriptive statistics
| Standard deviation | 339.556405 |
|---|---|
| Coefficient of variation (CV) | 1.63966415 |
| Kurtosis | 540.8834935 |
| Mean | 207.0889975 |
| Median Absolute Deviation (MAD) | 54.80801445 |
| Skewness | 17.88803781 |
| Sum | 621681.1705 |
| Variance | 115298.5522 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 78.7 | 4 | 0.1% | |
| 75.07 | 3 | 0.1% | |
| 50.71 | 3 | 0.1% | |
| 74.94 | 3 | 0.1% | |
| 28 | 3 | 0.1% | |
| 49.92 | 3 | 0.1% | |
| 114.44 | 3 | 0.1% | |
| 23.29 | 2 | 0.1% | |
| 84.79 | 2 | 0.1% | |
| 124.52 | 2 | 0.1% | |
| 40.07 | 2 | 0.1% | |
| 148.97 | 2 | 0.1% | |
| 43.6 | 2 | 0.1% | |
| 112.62 | 2 | 0.1% | |
| 92.29 | 2 | 0.1% | |
| 482.69 | 2 | 0.1% | |
| 312.49 | 2 | 0.1% | |
| 109.5 | 2 | 0.1% | |
| 793.09 | 2 | 0.1% | |
| 122.22 | 2 | 0.1% | |
| 111.89 | 2 | 0.1% | |
| 108.44 | 2 | 0.1% | |
| 29.69 | 2 | 0.1% | |
| 73.13 | 2 | 0.1% | |
| 58.23 | 2 | 0.1% | |
| Other values (2928) | 2944 | 98.1% |
| Value | Count | Frequency (%) | |
| 11.93 | 1 | < 0.1% | |
| 11.97 | 1 | < 0.1% | |
| 17.29666667 | 1 | < 0.1% | |
| 18.62 | 1 | < 0.1% | |
| 18.79 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 19.39 | 1 | < 0.1% | |
| 20.21 | 1 | < 0.1% | |
| 22.29 | 1 | < 0.1% | |
| 22.765 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 12164.26556 | 1 | < 0.1% | |
| 4862.396667 | 1 | < 0.1% | |
| 3802.595 | 1 | < 0.1% | |
| 3762.738182 | 1 | < 0.1% | |
| 3602.47 | 1 | < 0.1% | |
| 2960.05 | 1 | < 0.1% | |
| 2719.7725 | 1 | < 0.1% | |
| 2590.6 | 1 | < 0.1% | |
| 2361.506667 | 1 | < 0.1% | |
| 2217.99 | 1 | < 0.1% |
final_AmountPaid
Real number (ℝ≥0)
| Distinct | 2940 |
|---|---|
| Distinct (%) | 97.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 173.426903 |
|---|---|
| Minimum | 16.35 |
| Maximum | 3802.595 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 23.6 KiB |
Quantile statistics
| Minimum | 16.35 |
|---|---|
| 5-th percentile | 43.6345 |
| Q1 | 87.492 |
| median | 125.497 |
| Q3 | 182.5445833 |
| 95-th percentile | 444.158825 |
| Maximum | 3802.595 |
| Range | 3786.245 |
| Interquartile range (IQR) | 95.05258333 |
Descriptive statistics
| Standard deviation | 210.1504397 |
|---|---|
| Coefficient of variation (CV) | 1.211752249 |
| Kurtosis | 79.6100343 |
| Mean | 173.426903 |
| Median Absolute Deviation (MAD) | 45.11972222 |
| Skewness | 7.009777044 |
| Sum | 520627.5628 |
| Variance | 44163.20731 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 28 | 5 | 0.2% | |
| 61.11 | 4 | 0.1% | |
| 78.7 | 4 | 0.1% | |
| 75.07 | 3 | 0.1% | |
| 71.82 | 3 | 0.1% | |
| 114.44 | 3 | 0.1% | |
| 74.94 | 3 | 0.1% | |
| 50.71 | 3 | 0.1% | |
| 79.17 | 3 | 0.1% | |
| 49.92 | 3 | 0.1% | |
| 58.23 | 3 | 0.1% | |
| 108.44 | 2 | 0.1% | |
| 482.69 | 2 | 0.1% | |
| 120.875 | 2 | 0.1% | |
| 111.62 | 2 | 0.1% | |
| 45.53 | 2 | 0.1% | |
| 70.34 | 2 | 0.1% | |
| 84.79 | 2 | 0.1% | |
| 56.15 | 2 | 0.1% | |
| 84.63 | 2 | 0.1% | |
| 54.94 | 2 | 0.1% | |
| 24.75 | 2 | 0.1% | |
| 37.69 | 2 | 0.1% | |
| 109.5 | 2 | 0.1% | |
| 43.6 | 2 | 0.1% | |
| Other values (2915) | 2937 | 97.8% |
| Value | Count | Frequency (%) | |
| 16.35 | 1 | < 0.1% | |
| 18.62 | 1 | < 0.1% | |
| 18.79 | 1 | < 0.1% | |
| 18.82 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 19.13 | 1 | < 0.1% | |
| 19.39 | 1 | < 0.1% | |
| 20.21 | 1 | < 0.1% | |
| 20.41 | 1 | < 0.1% | |
| 21.31 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3802.595 | 1 | < 0.1% | |
| 3602.47 | 1 | < 0.1% | |
| 2960.05 | 1 | < 0.1% | |
| 2217.99 | 1 | < 0.1% | |
| 2114.63 | 1 | < 0.1% | |
| 1887.143333 | 1 | < 0.1% | |
| 1686.8875 | 1 | < 0.1% | |
| 1685.48 | 1 | < 0.1% | |
| 1647.408 | 1 | < 0.1% | |
| 1576.776667 | 1 | < 0.1% |
Review_score
Real number (ℝ≥0)
| Distinct | 987 |
|---|---|
| Distinct (%) | 32.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.046743344 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 23.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.700714286 |
| Q1 | 3.769230769 |
| median | 4.133333333 |
| Q3 | 4.5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 0.7307692308 |
Descriptive statistics
| Standard deviation | 0.7722079833 |
|---|---|
| Coefficient of variation (CV) | 0.1908220803 |
| Kurtosis | 3.942878763 |
| Mean | 4.046743344 |
| Median Absolute Deviation (MAD) | 0.3666666667 |
| Skewness | -1.594663883 |
| Sum | 12148.32352 |
| Variance | 0.5963051694 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5 | 428 | 14.3% | |
| 4 | 212 | 7.1% | |
| 4.5 | 112 | 3.7% | |
| 3 | 109 | 3.6% | |
| 4.333333333 | 72 | 2.4% | |
| 1 | 64 | 2.1% | |
| 3.5 | 55 | 1.8% | |
| 4.666666667 | 55 | 1.8% | |
| 3.666666667 | 44 | 1.5% | |
| 4.25 | 37 | 1.2% | |
| 4.2 | 30 | 1.0% | |
| 3.75 | 27 | 0.9% | |
| 4.75 | 26 | 0.9% | |
| 3.333333333 | 26 | 0.9% | |
| 2 | 23 | 0.8% | |
| 4.4 | 21 | 0.7% | |
| 3.833333333 | 20 | 0.7% | |
| 4.8 | 20 | 0.7% | |
| 2.5 | 19 | 0.6% | |
| 3.6 | 17 | 0.6% | |
| 4.6 | 15 | 0.5% | |
| 3.8 | 15 | 0.5% | |
| 3.857142857 | 14 | 0.5% | |
| 4.428571429 | 14 | 0.5% | |
| 4.375 | 13 | 0.4% | |
| Other values (962) | 1514 | 50.4% |
| Value | Count | Frequency (%) | |
| 1 | 64 | 2.1% | |
| 1.333333333 | 1 | < 0.1% | |
| 1.4 | 1 | < 0.1% | |
| 1.444444444 | 1 | < 0.1% | |
| 1.5 | 3 | 0.1% | |
| 1.571428571 | 1 | < 0.1% | |
| 1.666666667 | 1 | < 0.1% | |
| 1.75 | 3 | 0.1% | |
| 1.888888889 | 2 | 0.1% | |
| 2 | 23 | 0.8% |
| Value | Count | Frequency (%) | |
| 5 | 428 | 14.3% | |
| 4.962962963 | 1 | < 0.1% | |
| 4.933333333 | 1 | < 0.1% | |
| 4.909090909 | 1 | < 0.1% | |
| 4.9 | 1 | < 0.1% | |
| 4.888888889 | 6 | 0.2% | |
| 4.875 | 7 | 0.2% | |
| 4.866666667 | 1 | < 0.1% | |
| 4.857142857 | 7 | 0.2% | |
| 4.846153846 | 1 | < 0.1% |
| Distinct | 1647 |
|---|---|
| Distinct (%) | 54.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.03780588 |
|---|---|
| Minimum | 0 |
| Maximum | 90.60211321 |
| Zeros | 1326 |
| Zeros (%) | 44.2% |
| Memory size | 23.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2.732264642 |
| Q3 | 19.59205334 |
| 95-th percentile | 50 |
| Maximum | 90.60211321 |
| Range | 90.60211321 |
| Interquartile range (IQR) | 19.59205334 |
Descriptive statistics
| Standard deviation | 17.44392128 |
|---|---|
| Coefficient of variation (CV) | 1.449094748 |
| Kurtosis | 2.627684847 |
| Mean | 12.03780588 |
| Median Absolute Deviation (MAD) | 2.732264642 |
| Skewness | 1.714593936 |
| Sum | 36137.49326 |
| Variance | 304.2903897 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 1326 | 44.2% | |
| 50 | 19 | 0.6% | |
| 80 | 4 | 0.1% | |
| 75 | 4 | 0.1% | |
| 50 | 3 | 0.1% | |
| 1.196602788e-14 | 2 | 0.1% | |
| 50 | 2 | 0.1% | |
| 1.53980439e-14 | 2 | 0.1% | |
| 66.66666667 | 2 | 0.1% | |
| 32.53431038 | 1 | < 0.1% | |
| 60.63918707 | 1 | < 0.1% | |
| 32.9022709 | 1 | < 0.1% | |
| 17.38245212 | 1 | < 0.1% | |
| 58.03874507 | 1 | < 0.1% | |
| 22.45050396 | 1 | < 0.1% | |
| 43.43160499 | 1 | < 0.1% | |
| 12.64322798 | 1 | < 0.1% | |
| 28.08097889 | 1 | < 0.1% | |
| 25.20216401 | 1 | < 0.1% | |
| 28.52112441 | 1 | < 0.1% | |
| 16.16200879 | 1 | < 0.1% | |
| 26.46375395 | 1 | < 0.1% | |
| 21.39007505 | 1 | < 0.1% | |
| 12.60102414 | 1 | < 0.1% | |
| 56.88658248 | 1 | < 0.1% | |
| Other values (1622) | 1622 | 54.0% |
| Value | Count | Frequency (%) | |
| 0 | 1326 | 44.2% | |
| 1.126058218e-14 | 1 | < 0.1% | |
| 1.127666618e-14 | 1 | < 0.1% | |
| 1.132248802e-14 | 1 | < 0.1% | |
| 1.136177581e-14 | 1 | < 0.1% | |
| 1.152636444e-14 | 1 | < 0.1% | |
| 1.165015143e-14 | 1 | < 0.1% | |
| 1.196602788e-14 | 2 | 0.1% | |
| 1.201357234e-14 | 1 | < 0.1% | |
| 1.201892157e-14 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 90.60211321 | 1 | < 0.1% | |
| 88.15301696 | 1 | < 0.1% | |
| 87.36752523 | 1 | < 0.1% | |
| 86.28530032 | 1 | < 0.1% | |
| 83.40245588 | 1 | < 0.1% | |
| 83.33333333 | 1 | < 0.1% | |
| 83.0311319 | 1 | < 0.1% | |
| 81.29429476 | 1 | < 0.1% | |
| 81.08108108 | 1 | < 0.1% | |
| 81.03284213 | 1 | < 0.1% |
| Distinct | 772 |
|---|---|
| Distinct (%) | 25.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.79524761 |
|---|---|
| Minimum | 0 |
| Maximum | 323.8333333 |
| Zeros | 88 |
| Zeros (%) | 2.9% |
| Memory size | 23.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.3333333333 |
| Q1 | 2.833333333 |
| median | 8.333333333 |
| Q3 | 48.25 |
| 95-th percentile | 183.9916667 |
| Maximum | 323.8333333 |
| Range | 323.8333333 |
| Interquartile range (IQR) | 45.41666667 |
Descriptive statistics
| Standard deviation | 58.65196638 |
|---|---|
| Coefficient of variation (CV) | 1.594009286 |
| Kurtosis | 4.699429641 |
| Mean | 36.79524761 |
| Median Absolute Deviation (MAD) | 7.333333333 |
| Skewness | 2.230940001 |
| Sum | 110459.3333 |
| Variance | 3440.05316 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 88 | 2.9% | |
| 1.166666667 | 52 | 1.7% | |
| 1 | 51 | 1.7% | |
| 0.8333333333 | 51 | 1.7% | |
| 0.6666666667 | 45 | 1.5% | |
| 0.3333333333 | 44 | 1.5% | |
| 0.5 | 42 | 1.4% | |
| 2.166666667 | 41 | 1.4% | |
| 3.166666667 | 41 | 1.4% | |
| 1.666666667 | 40 | 1.3% | |
| 1.333333333 | 40 | 1.3% | |
| 1.833333333 | 40 | 1.3% | |
| 2.333333333 | 39 | 1.3% | |
| 2.666666667 | 37 | 1.2% | |
| 1.5 | 37 | 1.2% | |
| 3 | 35 | 1.2% | |
| 0.1666666667 | 35 | 1.2% | |
| 2.833333333 | 33 | 1.1% | |
| 6.166666667 | 32 | 1.1% | |
| 3.333333333 | 32 | 1.1% | |
| 2 | 32 | 1.1% | |
| 3.666666667 | 28 | 0.9% | |
| 3.5 | 26 | 0.9% | |
| 4 | 26 | 0.9% | |
| 4.333333333 | 26 | 0.9% | |
| Other values (747) | 2009 | 66.9% |
| Value | Count | Frequency (%) | |
| 0 | 88 | 2.9% | |
| 0.1666666667 | 35 | 1.2% | |
| 0.3333333333 | 44 | 1.5% | |
| 0.5 | 42 | 1.4% | |
| 0.6666666667 | 45 | 1.5% | |
| 0.8333333333 | 51 | 1.7% | |
| 1 | 51 | 1.7% | |
| 1.166666667 | 52 | 1.7% | |
| 1.333333333 | 40 | 1.3% | |
| 1.5 | 37 | 1.2% |
| Value | Count | Frequency (%) | |
| 323.8333333 | 1 | < 0.1% | |
| 322.8333333 | 1 | < 0.1% | |
| 308.1666667 | 1 | < 0.1% | |
| 301.5 | 1 | < 0.1% | |
| 298.5 | 1 | < 0.1% | |
| 289.6666667 | 1 | < 0.1% | |
| 288.1666667 | 1 | < 0.1% | |
| 287.3333333 | 1 | < 0.1% | |
| 287 | 1 | < 0.1% | |
| 286 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Week | product_category_name_english | encodedCategory | count_perCategory | paymentValue | final_AmountPaid | Review_score | DiscountPercent | SaleRecord_last6week | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | agro_industry_and_commerce | 0 | 4 | 624.422500 | 624.422500 | 3.250000 | 0.000000 | 0.0 |
| 1 | 1 | air_conditioning | 1 | 2 | 240.350000 | 240.350000 | 3.000000 | 0.000000 | 0.0 |
| 2 | 1 | art | 2 | 3 | 149.023333 | 149.023333 | 4.000000 | 0.000000 | 0.0 |
| 3 | 1 | audio | 4 | 2 | 432.745000 | 432.745000 | 3.000000 | 0.000000 | 0.0 |
| 4 | 1 | auto | 5 | 57 | 179.637193 | 155.896316 | 4.315789 | 13.216014 | 0.0 |
| 5 | 1 | baby | 6 | 51 | 166.258235 | 144.163529 | 3.686275 | 13.289390 | 0.0 |
| 6 | 1 | bed_bath_table | 7 | 186 | 127.048656 | 109.096559 | 3.940860 | 14.130096 | 0.0 |
| 7 | 1 | books_general_interest | 8 | 9 | 58.418889 | 53.593333 | 4.666667 | 8.260266 | 0.0 |
| 8 | 1 | books_technical | 10 | 1 | 296.360000 | 296.360000 | 5.000000 | 0.000000 | 0.0 |
| 9 | 1 | christmas_supplies | 12 | 2 | 104.165000 | 104.165000 | 5.000000 | 0.000000 | 0.0 |
Last rows
| Week | product_category_name_english | encodedCategory | count_perCategory | paymentValue | final_AmountPaid | Review_score | DiscountPercent | SaleRecord_last6week | |
|---|---|---|---|---|---|---|---|---|---|
| 2992 | 52 | office_furniture | 57 | 4 | 299.237500 | 299.237500 | 3.000000 | 0.000000e+00 | 17.833333 |
| 2993 | 52 | perfumery | 59 | 35 | 134.550571 | 132.842000 | 3.914286 | 1.269836e+00 | 83.666667 |
| 2994 | 52 | pet_shop | 60 | 13 | 115.799231 | 115.799231 | 4.692308 | 0.000000e+00 | 24.166667 |
| 2995 | 52 | signaling_and_security | 62 | 1 | 719.150000 | 719.150000 | 5.000000 | 0.000000e+00 | 0.666667 |
| 2996 | 52 | small_appliances | 63 | 2 | 297.710000 | 297.710000 | 5.000000 | 1.909355e-14 | 9.333333 |
| 2997 | 52 | sports_leisure | 65 | 60 | 142.936667 | 134.125667 | 4.033333 | 6.164269e+00 | 152.666667 |
| 2998 | 52 | stationery | 66 | 54 | 107.755000 | 115.309815 | 4.092593 | 0.000000e+00 | 53.500000 |
| 2999 | 52 | telephony | 68 | 58 | 55.685862 | 50.249828 | 4.379310 | 9.761965e+00 | 85.666667 |
| 3000 | 52 | toys | 69 | 37 | 191.202703 | 186.727568 | 4.027027 | 2.340519e+00 | 141.333333 |
| 3001 | 52 | watches_gifts | 70 | 54 | 202.143519 | 193.216111 | 3.888889 | 4.416371e+00 | 108.333333 |